Intoduction: Handling the files: reading multiple csv files into the same dataframe-called RawData. Data will be organized into a specific template, some raws within the RawData will be extracted into a new dataframe - Those will be the data regarding the whole cells. This small dataframe will be organized, some data will be normalized and some will be added-dataframe is called CellsDataC. The rest, as well, will be assigned into a new dataframe- called Data2Handle: this data will be cleaned, reorganized and some rows will be eliminated-due to resolution issues.

library(ggplot2)
Want to understand how all the pieces fit together? Read R for Data Science:
https://r4ds.had.co.nz/
library(dplyr)

Attaching package: ‘dplyr’

The following objects are masked from ‘package:stats’:

    filter, lag

The following objects are masked from ‘package:base’:

    intersect, setdiff, setequal, union
library(tidyr)
library(tibble)
library(stringr)
library(reshape2)

Attaching package: ‘reshape2’

The following object is masked from ‘package:tidyr’:

    smiths
library(rlist)
Registered S3 method overwritten by 'data.table':
  method           from
  print.data.table     
library(readr)
library(purrr)
library(dgof)

Attaching package: ‘dgof’

The following object is masked from ‘package:stats’:

    ks.test
library(car)
Loading required package: carData

Attaching package: ‘car’

The following object is masked from ‘package:purrr’:

    some

The following object is masked from ‘package:dplyr’:

    recode
library(Rmisc)
Loading required package: lattice
Loading required package: plyr
-----------------------------------------------------------------------------------------
You have loaded plyr after dplyr - this is likely to cause problems.
If you need functions from both plyr and dplyr, please load plyr first, then dplyr:
library(plyr); library(dplyr)
-----------------------------------------------------------------------------------------

Attaching package: ‘plyr’

The following object is masked from ‘package:purrr’:

    compact

The following objects are masked from ‘package:dplyr’:

    arrange, count, desc, failwith, id, mutate, rename, summarise, summarize
#library(agricolae)
library(jtools)
library(xlsx)
RawData <- list.files(full.names = TRUE) %>% 
  lapply(read_csv) %>% 
  bind_rows
Warning: One or more parsing issues, see `problems()` for detailsRows: 691 Columns: 1── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): ---
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 252 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 200 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 155 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 206 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 254 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 90 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 164 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 142 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 266 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 180 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 228 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 140 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 196 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 152 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 213 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 113 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 329 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 217 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 207 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 175 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 110 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 146 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 180 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 206 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 202 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 217 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 111 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 102 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 173 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 156 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 164 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 149 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 195 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 103 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 187 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 301 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 301 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 271 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 171 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 163 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 229 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 230 Columns: 10── Column specification ───────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 271 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 318 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 320 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 271 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 302 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 110 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 201 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 141 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 204 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 88 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 215 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 233 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 130 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 135 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 201 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 228 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 351 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 248 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 195 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 129 Columns: 10── Column specification ────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 246 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 124 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 246 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 412 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 127 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 282 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 227 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 209 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 96 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 227 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 71 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 98 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 131 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 257 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 94 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 450 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 219 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 223 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 229 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 145 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 129 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 315 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 232 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 188 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 135 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 121 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 206 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 143 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 91 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 106 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 102 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 143 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 131 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 169 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 115 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 104 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 107 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 158 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 111 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 125 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 290 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 202 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 163 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 121 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 92 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 73 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 168 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 228 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 200 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 106 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 117 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 135 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.New names:Rows: 154 Columns: 10── Column specification ──────────────────────────────────────────────────────────────────────────────────────────────────────
Delimiter: ","
chr (1): Label
dbl (9): ...1, Area, Mean, StdDev, IntDen, Median, RawIntDen, MinThr, MaxThr
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#Reading multiple csv files into the same dataframe, with only one Head # when the file only contains the data and the R-notebook

Rdata <- RawData %>% drop_na(Label) #dropping NA rows
Rdata <- Rdata[,-c(1)] #deleting extra non-informative columns that appears while reading the csv file to the dataframe
Data2Handle <- Rdata%>%
  separate(Label,c("Treatment","CellNumber"),sep = "_") #%>% #seperating the label column into 2 diffrenet columns labeled as treatment and cell number- these two variables identify diffrernt cells within the same treatment, both are needed in the next steps of analysis

Data2Handle <- select(Data2Handle,-c(MinThr,MaxThr)) #deleting another 2 non-informative columns
Data2Handle[Data2Handle$Treatment=='ShA1/2+ShB1/2+s97',]$Treatment <- c('2ShA+2ShB+s97')
Data2Handle[Data2Handle$Treatment=='1ShA+1ShB+s97',]$Treatment <- c('2ShA+2ShB+s97')
Data2Handle[Data2Handle$Treatment=='sha34+shb14+s97',]$Treatment <- c('3ShA+1ShB+s97')
Data2Handle[Data2Handle$Treatment=='shb3+sha1+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA4+ShB0+s97',]$Treatment <- c('4ShA+0ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA0+ShB4+s97',]$Treatment <- c('0ShA+4ShB+s97')
Data2Handle[Data2Handle$Treatment=='shb34+sha14+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA3+ShB1+s97',]$Treatment <- c('3ShA+1ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA1+ShB3+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA2+ShB2+s97',]$Treatment <- c('2ShA+2ShB+s97')


CellsDataC <- Data2Handle[Data2Handle$Area>100,] # subsetting the data for the whole cell into a new dataframe, in each csv file there is one row for the whole cell- these data contains information about cells 
Data2Handle <- Data2Handle[Data2Handle$Area<100 ,] #eliminating the rows of data regarding the whole cells">300" and also the rows where the clusters area smaller than the resolution of the microscope
CellsData <-  Data2Handle %>%
  group_by(Treatment) %>%
  group_by(CellNumber,.add = TRUE) %>%
  dplyr::summarise(ClusterCount = n(),SumofRawIntDen=sum(RawIntDen),.groups = "keep") #grouping the cells data by treatment and cell number-both variables needed to identify unique cells from each other as written before- and then counting for each cell the cluster number(aka the rows in each file which belong to the specific cell)
CellsDataC <- merge(CellsDataC,CellsData, by=c("Treatment","CellNumber"))
#grouping the cells data by treatment and cell number-both variables needed to identify unique cells from each other as written before- and then counting for each cell the cluster number(aka the rows in each file which belong to the specific cell)
CellsDataC <- CellsDataC %>%
  mutate(ClustersNumberPerCell=ClusterCount/Area,
         non_normlizedClusterNumber=ClusterCount,
         CellSurfaceExpression=RawIntDen/Area,
         FractionofChannelsinClusters=SumofRawIntDen/RawIntDen)
CellsDataC <- CellsDataC[,-9]
#CellsDataC$ClustersNumberPerCell <- CellsData$ClusterCount/CellsDataC$Area
#CellsDataC$non_normlizedClusterNumber<- CellsData$ClusterCount#adding the vairable ClusterCount to the CellsDataC
#CellsDataC$CellSurfaceExpression <- CellsDataC$RawIntDen/CellsDataC$Area #adding the vairable CellSurfaceExpression to the CellsDataC- defined as the clusternumbers per cell normalized to the whole cell area (since a higher area of membrane will contain as well a higher number of clusters)  
#CellsDataC$FractionofChannelsinClusters=CellsData$SumofRawIntDen/CellsDataC$RawIntDen
#CellsDataC$SumofRawIntDen <- CellsData$SumofRawIntDen
#CellsDataC #showing the data to have a general look


Data2Handle <-Data2Handle %>%
  mutate(Density= RawIntDen/Area) # adding a new variable calculated by the area and RawIntDen variables in the data for each cluster in the cells, defined as Density
Data2Handle<- Data2Handle[,-9]#showing the data to have a general look
Data2Handle
CellsDataC

CellsData_sum <-  Data2Handle %>%
  group_by(Treatment) %>%
  group_by(CellNumber,.add = TRUE) %>%
  dplyr::summarise(ClusterCount = n(),MeanDen = mean(Density),MeanArea = mean(Area),Standardddensity= sd(Density),StandarddArea= sd(Area),.groups = "keep")
write.csv(CellsData_sum,"CellsData_sum.csv", row.names = TRUE)

write.xlsx(CellsDataC,"CellsDataC_sum.xlsx", row.names = TRUE)

User must give as an input the number of variables he wants to be analyzed and their exact names as they show up in the Data: Data2Handle or CellsDataC. the input in the variable VariableToAnalyze will be used in next chunks to decide whether to run a specific commands or not-that is to run only the parts for the variables shows in VariableToAnalyze as the user decide However, rigth now this chunk is deactivated- and the value of VariableToAnalyze has been assigned as all the variables to be analyzed .

numberofvar <- as.integer(readline(prompt=paste("How many you want to analyze:"))) #asking the user to decide how many variables he want to analyze 
4
numvec <- c(1:numberofvar) # creating vector of number according to the varaible number input in the previous command
inputvec <- function(num=0)#function to take multiple variable names
  #argument: the number of the variable to take as an input, to show to it to the user
  #value: the exact name of the variable to analyse -as it showes in the dataframe
{
  vec <- readline(prompt=paste("what is your", num, "variable exact name "))
  return(vec)
}
VariableToAnalyze <- sapply(numvec, inputvec)# vectorized command to take as an input the exact names of the variables to be analyzed
Density
Area
ClustersNumberPerCell
CellSurfaceExpression

The next chuck will process the Data for the specific variables given in VariableToAnalyze: the function will perform Kolmogorov-Smirnov Normality test and Levene’s test for homoscedasticity. The result for the variables will be assigned in a new dataframe with the proper column names. Note: True in the new dataframe-indicate4transformationAll stands for p-value<0.05 (aka the assumption is violated)

DeterminePropTest <- function(VarName,Data)
  #description of the function: the function accept as an argumnt the variable name as it displayed in the data and check homogeneity of variance across groups and normality of residuals
  #argument-the name of variable to check and the data
  #value- vector of logical values (true stands for assumption violation) the 1st-homogeneity-levene's test, the 2nd- Kolmogorov-Smirnov Tests
{
names(Data)[names(Data) == VarName] <- "VarName" #in order to perform the test on a specific variable, changing the name of this variable to VarName- this make it easier to generalize the code, thus it could handle all the variable names as will be seen in next commands
  Model  = lm(VarName ~ Treatment,
              data=Data) #creating the model for Anova- defining the dependent and independent variables 
Anova(Model)   
summary(Model)
#all these parameters sets the model for further analysis
vec <- as.integer(readline(prompt=paste("Do you want to see the plots (Residuals Histogram and Residuals vs. predicted values) of the variable",VarName,"Yes-1, No-0 "))) #according to the user choice-shows or not-the plots to assess the data by himself in addition to the test carried out in the function
if (vec==1) 
  {hist(residuals(Model),col="darkgray")
#plot(fitted(Model),residuals(Model)) #plotting the fitted data to see The distribution of these residuals as it should be approximately normal.
qqnorm(Model$residuals)  # Another method of looking at the residuals normality
qqline(Model$residuals)# setting the normal line, in order to assist the residuals normality
pvaluelevene <- leveneTest(VarName ~ Treatment,data=Data,center=mean)[1,3] #assigning the p-value of levens test into a new variable
pvalueks <- ks.test(Model$residuals,pnorm,mean(Model$residuals),sd(Model$residuals))[[2]] #assigning the p-value of ks into a new variable
return (c(pvaluelevene<0.01,pvalueks<0.01)) #returning both values
}else
{pvaluelevene <- leveneTest(VarName ~ Treatment,data=Data,center=mean)[1,3]#assigning the p-value of levens test into a new variable
pvalueks <- ks.test(Model$residuals,pnorm,mean(Model$residuals),sd(Model$residuals))[[2]]#assigning the p-value of ks into a new variable
return (c(pvaluelevene<0.01,pvalueks<0.01))} #returning both values
}
#result4Transofrmation <- DeterminePropTest("ClustersNumberPerCell", CellsDataC)
#VariableToAnalyze <- c('ClustersNumberPerCell','Density','Area','CellSurfaceExpression')
#x <- VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell']
indicate4transformationW <- t(sapply(VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell'|VariableToAnalyze=='CellSurfaceExpression'|VariableToAnalyze=='FractionofChannelsinClusters'], DeterminePropTest, CellsDataC)) #subsetting the variables name belongging to the dataframe CellsDataC, on these variables, the function is called
1

Warning: group coerced to factor.
1

Warning: group coerced to factor.
1

Warning: group coerced to factor.

indicate4transformationC <- t(sapply(VariableToAnalyze[VariableToAnalyze=='Area'|VariableToAnalyze=='Density'], DeterminePropTest, Data2Handle)) #subsetting the variables name belongging to the dataframe Data2Handle, on these variables, the function is called
1

Warning: group coerced to factor.Warning: default ks.test() cannot compute correct p-values with ties;
 see help page for one-sample Kolmogorov test for discrete distributions.
1

Warning: group coerced to factor.Warning: default ks.test() cannot compute correct p-values with ties;
 see help page for one-sample Kolmogorov test for discrete distributions.

# a result of True for both columns stands for a need of transformation in the variable- shows in name of the row 
if (length(indicate4transformationC)!=0){ # this condition is to handle a situation where is no variables given by the user in the second chunk
  colnames(indicate4transformationC) <- c("Levene's test", "Kolmogorov-Smirnov Tests") #setting the column names
  indicate4transformationC #showingg the results
  indicate4transformationC <- as.data.frame(indicate4transformationC) %>%
  add_column(VarN=VariableToAnalyze[VariableToAnalyze=='Area'|VariableToAnalyze=='Density'],.before=1) #adding a new column with the variable names
row.names(indicate4transformationC) <- NULL #resetting the the rownames
indicate4transformationC
}
 if (length(indicate4transformationW)!=0){ # this condition is to handle a sitution where is no variables given by the user in the second chunk 
  colnames(indicate4transformationW) <- c("Levene's test", "Kolmogorov-Smirnov Tests")#setting the column names
  indicate4transformationW #showingg the results
indicate4transformationW <- as.data.frame(indicate4transformationW) %>%
  add_column(VarN=VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell'|VariableToAnalyze=='CellSurfaceExpression'|VariableToAnalyze=='FractionofChannelsinClusters'],.before=1) #adding a new column with the variable names
row.names(indicate4transformationW) <- NULL #resetting the the rownames
indicate4transformationW
}
indicate4transformationAll <- rbind.data.frame(indicate4transformationC,indicate4transformationW) #bindingg the dataframes togather
indicate4transformationAll

This chunk is responsible for calculating the transformation when ever needed. the transformation type is determined by an Manual calibration of my own data behavior. also here the transformation is carried out only when the data met the conditions to perform one (when both assumptions: normality and homoscedasticity, are violated) Note: I used the “->>” whitch makes the data global, This allowed me to change and update directly both dataframes whenever i needed(CellsDataC/Data2Handle)

trans.log <- function(x){return(log2(x))} #mathmatical function for transformation calculation
trans.sqrt <- function(x){return(sqrt(x+0.5))} #mathmatical function for transformation calculation
trans.receip <- function(x){return(1/(x))} #mathmatical function for transformation calculation
#old <- Data2Handle
#Data2Handle <- backup
TransUpdate <- function(VarRow)
  #function description: the function accept as arggument/input a row of dataframe, containeing 3 columns. the first row is the variable needed to tranform the second and the third are the result of levenes and ks test - True stands for a p-value lower than 0.05 (both true stands for a need of transformation of the data for this specific variable)
  #outpot is a dataframe updated with a column of transformated data of variable
{
  #VarRow <- indicate4transformation[2,]
  Data2Handle<<-Data2Handle # changing the dataframe of each cluster to be global/ also shows in the function itself
  CellsDataC<<-CellsDataC# changing the dataframe of the whole cells to be global/ also shows in the function itself
  if (VarRow[1] %in% c("Density","Area")) #checking if the variable belong to the data of clusters
  {
    if (VarRow[1]=="Density"){ #if the variable in the argument is density, then calculating the appropriate type of transformation for the density
      Data2Handle$Density.trans<<- sapply(Data2Handle$Density,trans.sqrt)
    } else{ #if the variable in the argument is Area, then calculating the appropriate type of transformation for the area
      Data2Handle$Area.trans<<- sapply(Data2Handle$Area,trans.log)}
  }else #otherwise, the variable belong to the whole cell data
  {
    if (VarRow[1]=='CellSurfaceExpression'){CellsDataC$CellSurfaceExpression.trans <<- sapply(CellsDataC$CellSurfaceExpression,trans.sqrt)} #if the variable in the argument is CellSurfaceExpression, then calculating the appropriate type of transformation for the CellSurfaceExpression
      else{CellsDataC$ClustersNumberPerCell.trans <<- sapply(CellsDataC$ClustersNumberPerCell,trans.receip)} #if the variable in the argument is ClustersNumberPerCell, then calculating the appropriate type of transformation for the ClustersNumberPerCell
  }
  
  CellsDataC<<-CellsDataC
  Data2Handle<<-Data2Handle 
}
if(!is.null(indicate4transformationAll)){
apply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T,],MARGIN=1,TransUpdate)} #applying the function on the data frame
$`1`

$`2`
#please pay attention, the apply is suitable for updating global variables
Data2Handle
CellsDataC
NA

This chunk contain all the possible statistics test suitable for a numeric dependent variable and a independent variable with more then two levels. This is needed since i have partial data for now, i don’t know whether my data will violate both assumption in the future or not, so just in case. The tests will be performed according to indicate4transformationAll results. When the conditions aren’t met, A Null dataframe will be sent to the test so no result will be shown. As it could be seen my partial data will need ANOVA (after transformation/ with no transformation)


welchtest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of welch test of the variable
{
  names(Data)[names(Data) == varname] <- "varname" #changing the column name-in the argumnets- to varname, this helps generlaizing the code, so it would run in each variable name
  oneway.test(varname ~ Treatment,
            data=Data,
            var.equal=FALSE) #performing the test
}
print("Welch test result if the conditions are met is below")
[1] "Welch test result if the conditions are met is below"
CWelch <- sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F & (indicate4transformationAll$VarN=='Area'|indicate4transformationAll$VarN=='Density'),1], welchtest, Data2Handle)
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F & (indicate4transformationAll$VarN=='CellSurfaceExpression'|indicate4transformationAll$VarN=='ClustersNumberPerCell'),1], welchtest, CellsDataC)
          CellSurfaceExpression                                     
statistic 2.929638                                                  
parameter numeric,2                                                 
p.value   0.02891893                                                
method    "One-way analysis of means (not assuming equal variances)"
data.name "varname and Treatment"                                   
#both commands send the the two diffrenet Dataframe i have to the function for welch-test with the relavant variable name, only when the result of levens and Kolmogorov-Smirno is T,F respectivly - that is heteroscedasticity is found in the diffrent groups. this is done by sending the appropriate dataframe with the variable to test.
####################

KWtest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of Kruskal-walis test of the variable
{
  names(Data)[names(Data) == varname] <- "varname" #changing the column name-in the argumnets- to varname, this helps generlaizing the code, so it would run in each variable name
  kruskal.test(varname ~ Treatment,
                data = Data) #performing the test
}
print("Kruskal-Wallis test result if the conditions are met is below")
[1] "Kruskal-Wallis test result if the conditions are met is below"
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T & (indicate4transformationAll$VarN=='Area'|indicate4transformationAll$VarN=='Density'),1], KWtest, Data2Handle)
named list()
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T & (indicate4transformationAll$VarN=='CellSurfaceExpression'|indicate4transformationAll$VarN=='ClustersNumberPerCell'),1], KWtest, CellsDataC)
named list()
#both commands send the the two diffrenet Dataframe i have to the function for kruskal-wallis, only when the result of levens and Kolmogorov-Smirno is F,T respectivly - that is Normality assumption is violated. this is done each time by sending the appropriate dataframe with the variable to test.
#############


Anovatest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of Anova test of the variable
{
  names(Data)[names(Data) == varname] <- "varname"
  Model  = lm(varname ~ Treatment,
              data=Data) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis and Treatment respectively)
c <- c(unlist(anova(Model)[1,4:5]) )#
return(c)
}
#############

print("ANOVA test result if the conditions are met is below")
[1] "ANOVA test result if the conditions are met is below"
new4anova <- indicate4transformationAll[(indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F) |(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),] #subsetiing the variables suitable for ANOVA test (transformed data and data which is Normal and has homogeneity of variances), from the result dataframe of levens and Kolmogorov-Smirnov.

new4anova[(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),]$VarN <- sapply(new4anova[(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),]$VarN,paste,sep=".","trans")
#updating the results dataframe first column when the variable in the row is transformated.

AnovaresC <- sapply(new4anova[(new4anova$VarN=="Density.trans" | new4anova$VarN=="Area.trans" |new4anova$VarN=="Density" | new4anova$VarN=="Area"),1],Anovatest, Data2Handle)
#appling ANOVA on the variables suited with the test in the Dataframe of the clusters
AnovaresW <- sapply(new4anova[(new4anova$VarN=="CellSurfaceExpression.trans" | new4anova$VarN=="ClustersNumberPerCell.trans"|new4anova$VarN=="CellSurfaceExpression" | new4anova$VarN=="ClustersNumberPerCell"),1],Anovatest, CellsDataC)
#appling ANOVA on the variables suited with the test in the Dataframe of whole cells

AnovaresC
          Area.trans Density.trans
F value 6.654356e+00  3.506711e+02
Pr(>F)  2.392309e-05 7.557020e-293
AnovaresW
        ClustersNumberPerCell
F value            2.19660787
Pr(>F)             0.07400829

this

#check differences in FractionofChannelsinClusters
AnovaresW_3 <- sapply("FractionofChannelsinClusters",Anovatest, CellsDataC)
AnovaresW_3
        FractionofChannelsinClusters
F value                 1.713494e+01
Pr(>F)                  5.968180e-11
#check 
#ModelFraction  = lm(FractionofChannelsinClusters ~ Treatment,
#              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis and Treatment respectively)
#summary(ModelFraction)

In this chuck a Tuckey pairwise multiple comparison (post-hoc) is done for all the variables.

library(agricolae)
#in all those commands a tukey multiple comparsion test is carried out
if ("Density" %in% VariableToAnalyze){
model  = lm(Density.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Density.trans and Treatment respectively)
PWDensity <- (HSD.test(model, "Treatment", group = F)) #tukey test
PWDensity}
$statistics
   MSerror    Df     Mean       CV
  4470.641 21210 430.2129 15.54181

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.857987  0.05

$means
              Density.trans      std    r      Min      Max      Q25      Q50      Q75
0ShA+4ShB+s97      400.4135 68.73403 3291 244.1150 665.8430 348.1639 394.4883 445.6004
1ShA+3ShB+s97      418.2675 63.04118 4318 234.1932 717.2235 374.9122 414.7732 459.1624
2ShA+2ShB+s97      428.5679 66.76477 4497 251.5226 694.1119 379.4243 427.1810 476.1074
3ShA+1ShB+s97      448.5453 70.70435 4884 258.2291 811.9168 399.4243 449.2066 496.3832
4ShA+0ShB+s97      446.1919 64.67950 4225 274.7080 770.7598 401.2400 442.2648 487.7584

$comparison
                              difference pvalue signif.        LCL        UCL
0ShA+4ShB+s97 - 1ShA+3ShB+s97 -17.853969 0.0000     *** -22.074715 -13.633224
0ShA+4ShB+s97 - 2ShA+2ShB+s97 -28.154371 0.0000     *** -32.338627 -23.970116
0ShA+4ShB+s97 - 3ShA+1ShB+s97 -48.131777 0.0000     *** -52.245383 -44.018171
0ShA+4ShB+s97 - 4ShA+0ShB+s97 -45.778389 0.0000     *** -50.019179 -41.537600
1ShA+3ShB+s97 - 2ShA+2ShB+s97 -10.300402 0.0000     *** -14.186726  -6.414078
1ShA+3ShB+s97 - 3ShA+1ShB+s97 -30.277808 0.0000     *** -34.087962 -26.467653
1ShA+3ShB+s97 - 4ShA+0ShB+s97 -27.924420 0.0000     *** -31.871547 -23.977292
2ShA+2ShB+s97 - 3ShA+1ShB+s97 -19.977405 0.0000     *** -23.747098 -16.207713
2ShA+2ShB+s97 - 4ShA+0ShB+s97 -17.624018 0.0000     *** -21.532101 -13.715934
3ShA+1ShB+s97 - 4ShA+0ShB+s97   2.353388 0.4494          -1.478959   6.185734

$groups
NULL

attr(,"class")
[1] "group"
if ("Area" %in% VariableToAnalyze){
Model  = lm(Area.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Area.trans and Treatment respectively)
PWArea <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWArea}
$statistics
   MSerror    Df      Mean        CV
  1.026261 21210 -3.046036 -33.25784

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.857987  0.05

$means
              Area.trans       std    r       Min         Max       Q25       Q50       Q75
0ShA+4ShB+s97  -3.095240 1.0055057 3291 -5.025195 -0.15848089 -3.827277 -3.068278 -2.331314
1ShA+3ShB+s97  -3.083592 1.0310570 4318 -5.025195  0.13049734 -3.864744 -3.090311 -2.331314
2ShA+2ShB+s97  -2.993781 1.0266827 4497 -5.025195 -0.03319156 -3.755123 -2.942751 -2.254696
3ShA+1ShB+s97  -3.039447 0.9862198 4884 -5.025195 -0.28612826 -3.790739 -3.004149 -2.292496
4ShA+0ShB+s97  -3.032560 1.0163072 4225 -5.025195  0.32611506 -3.790739 -3.004149 -2.292496

$comparison
                                difference pvalue signif.         LCL          UCL
0ShA+4ShB+s97 - 1ShA+3ShB+s97 -0.011647596 0.9877         -0.07559652  0.052301332
0ShA+4ShB+s97 - 2ShA+2ShB+s97 -0.101458328 0.0001     *** -0.16485440 -0.038062260
0ShA+4ShB+s97 - 3ShA+1ShB+s97 -0.055792579 0.1043         -0.11811823  0.006533076
0ShA+4ShB+s97 - 4ShA+0ShB+s97 -0.062679487 0.0599       . -0.12693210  0.001573130
1ShA+3ShB+s97 - 2ShA+2ShB+s97 -0.089810732 0.0003     *** -0.14869281 -0.030928654
1ShA+3ShB+s97 - 3ShA+1ShB+s97 -0.044144983 0.2260         -0.10187301  0.013583043
1ShA+3ShB+s97 - 4ShA+0ShB+s97 -0.051031891 0.1361         -0.11083521  0.008771424
2ShA+2ShB+s97 - 3ShA+1ShB+s97  0.045665749 0.1868         -0.01144923  0.102780730
2ShA+2ShB+s97 - 4ShA+0ShB+s97  0.038778841 0.3814         -0.02043292  0.097990602
3ShA+1ShB+s97 - 4ShA+0ShB+s97 -0.006886909 0.9976         -0.06495117  0.051177352

$groups
NULL

attr(,"class")
[1] "group"
if ("ClustersNumberPerCell" %in% VariableToAnalyze){
Model  = lm(ClustersNumberPerCell ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-ClustersNumberPerCell and Treatment respectively)
PWClustersNumberPerCell <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWClustersNumberPerCell}
$statistics
      MSerror  Df      Mean       CV
  0.008271292 110 0.2599713 34.98334

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              ClustersNumberPerCell        std  r        Min       Max       Q25       Q50       Q75
0ShA+4ShB+s97             0.2247688 0.05837489 23 0.13479727 0.3526948 0.1844342 0.2214116 0.2495249
1ShA+3ShB+s97             0.2433054 0.08573203 24 0.10937268 0.4316930 0.1851996 0.2285649 0.3103415
2ShA+2ShB+s97             0.2595909 0.08390007 24 0.14180561 0.4120641 0.1776014 0.2629209 0.3253100
3ShA+1ShB+s97             0.2964855 0.11610224 23 0.11165832 0.6063847 0.2008870 0.2992935 0.3734330
4ShA+0ShB+s97             0.2780164 0.10182547 21 0.09983524 0.4733694 0.2005765 0.2889477 0.3405698

$comparison
                               difference pvalue signif.         LCL         UCL
0ShA+4ShB+s97 - 1ShA+3ShB+s97 -0.01853658 0.9564         -0.09214058 0.055067413
0ShA+4ShB+s97 - 2ShA+2ShB+s97 -0.03482209 0.6843         -0.10842608 0.038781909
0ShA+4ShB+s97 - 3ShA+1ShB+s97 -0.07171665 0.0644       . -0.14609955 0.002666243
0ShA+4ShB+s97 - 4ShA+0ShB+s97 -0.05324756 0.3027         -0.12938088 0.022885757
1ShA+3ShB+s97 - 2ShA+2ShB+s97 -0.01628550 0.9716         -0.08910227 0.056531260
1ShA+3ShB+s97 - 3ShA+1ShB+s97 -0.05318007 0.2710         -0.12678407 0.020423925
1ShA+3ShB+s97 - 4ShA+0ShB+s97 -0.03471098 0.7056         -0.11008349 0.040661530
2ShA+2ShB+s97 - 3ShA+1ShB+s97 -0.03689457 0.6352         -0.11049856 0.036709430
2ShA+2ShB+s97 - 4ShA+0ShB+s97 -0.01842548 0.9608         -0.09379799 0.056947035
3ShA+1ShB+s97 - 4ShA+0ShB+s97  0.01846909 0.9619         -0.05766423 0.094602409

$groups
NULL

attr(,"class")
[1] "group"
if ("CellSurfaceExpression" %in% VariableToAnalyze){
Model  = lm(CellSurfaceExpression ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-CellSurfaceExpression and Treatment respectively)
PWCellSurfaceExpression <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWCellSurfaceExpression}
$statistics
    MSerror  Df     Mean       CV
  256441691 110 49133.07 32.59271

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              CellSurfaceExpression      std  r      Min       Max      Q25      Q50      Q75
0ShA+4ShB+s97              51590.10 10076.60 23 37444.09  77562.41 46943.87 50556.57 54562.10
1ShA+3ShB+s97              49019.55 11285.33 24 28552.33  81037.94 44728.19 47523.83 52665.58
2ShA+2ShB+s97              44041.60 11102.78 24 20427.47  69952.79 37584.27 42221.34 49288.98
3ShA+1ShB+s97              57416.28 28273.77 23 15125.78 121335.24 38004.18 58794.03 72606.45
4ShA+0ShB+s97              43318.50 11452.80 21 23119.77  64618.78 38922.08 44239.60 50289.20

$comparison
                               difference pvalue signif.         LCL        UCL
0ShA+4ShB+s97 - 1ShA+3ShB+s97   2570.5499 0.9817         -10389.5707 15530.6704
0ShA+4ShB+s97 - 2ShA+2ShB+s97   7548.5027 0.4908          -5411.6178 20508.6233
0ShA+4ShB+s97 - 3ShA+1ShB+s97  -5826.1727 0.7317         -18923.4412  7271.0958
0ShA+4ShB+s97 - 4ShA+0ShB+s97   8271.6046 0.4313          -5133.8771 21677.0862
1ShA+3ShB+s97 - 2ShA+2ShB+s97   4977.9529 0.8181          -7843.5528 17799.4585
1ShA+3ShB+s97 - 3ShA+1ShB+s97  -8396.7226 0.3806         -21356.8431  4563.3980
1ShA+3ShB+s97 - 4ShA+0ShB+s97   5701.0547 0.7563          -7570.4645 18972.5739
2ShA+2ShB+s97 - 3ShA+1ShB+s97 -13374.6754 0.0396       * -26334.7960  -414.5549
2ShA+2ShB+s97 - 4ShA+0ShB+s97    723.1019 0.9999         -12548.4173 13994.6210
3ShA+1ShB+s97 - 4ShA+0ShB+s97  14097.7773 0.0341       *    692.2956 27503.2590

$groups
NULL

attr(,"class")
[1] "group"
new_model=lm(FractionofChannelsinClusters ~ Treatment,
              data=CellsDataC)
PWFractionofChannelsinClusters <- HSD.test(new_model, "Treatment", group = F)
PWFractionofChannelsinClusters
$statistics
      MSerror  Df      Mean       CV
  0.002155109 110 0.1723334 26.93799

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              FractionofChannelsinClusters        std  r        Min       Max       Q25       Q50       Q75
0ShA+4ShB+s97                    0.1184674 0.03415173 23 0.06452483 0.1952345 0.1017302 0.1145079 0.1275438
1ShA+3ShB+s97                    0.1441908 0.04804622 24 0.04769675 0.2854157 0.1127500 0.1455464 0.1671480
2ShA+2ShB+s97                    0.1963731 0.04404672 24 0.13103603 0.2817908 0.1611118 0.1946739 0.2279166
3ShA+1ShB+s97                    0.1902971 0.03848746 23 0.09909048 0.2634846 0.1659696 0.1936079 0.2142701
4ShA+0ShB+s97                    0.2163439 0.06367791 21 0.13665866 0.3523840 0.1739099 0.2049573 0.2194671

$comparison
                                difference pvalue signif.         LCL          UCL
0ShA+4ShB+s97 - 1ShA+3ShB+s97 -0.025723403 0.3240         -0.06329411  0.011847301
0ShA+4ShB+s97 - 2ShA+2ShB+s97 -0.077905661 0.0000     *** -0.11547636 -0.040334957
0ShA+4ShB+s97 - 3ShA+1ShB+s97 -0.071829663 0.0000     *** -0.10979795 -0.033861375
0ShA+4ShB+s97 - 4ShA+0ShB+s97 -0.097876438 0.0000     *** -0.13673822 -0.059014656
1ShA+3ShB+s97 - 2ShA+2ShB+s97 -0.052182258 0.0016      ** -0.08935112 -0.015013391
1ShA+3ShB+s97 - 3ShA+1ShB+s97 -0.046106260 0.0081      ** -0.08367696 -0.008535556
1ShA+3ShB+s97 - 4ShA+0ShB+s97 -0.072153035 0.0000     *** -0.11062647 -0.033679603
2ShA+2ShB+s97 - 3ShA+1ShB+s97  0.006075998 0.9915         -0.03149471  0.043646702
2ShA+2ShB+s97 - 4ShA+0ShB+s97 -0.019970778 0.6035         -0.05844421  0.018502655
3ShA+1ShB+s97 - 4ShA+0ShB+s97 -0.026046775 0.3457         -0.06490856  0.012815007

$groups
NULL

attr(,"class")
[1] "group"

new try

library(agricolae)
#in all those commands a tukey multiple comparsion test is carried out
if ("Density" %in% VariableToAnalyze){
model  = lm(Density.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Density.trans and Treatment respectively)
PWDensity <- (HSD.test(model, "Treatment", group = T)) #tukey test
PWDensity}
$statistics
   MSerror    Df     Mean       CV
  4470.641 21210 430.2129 15.54181

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.857987  0.05

$means
              Density.trans      std    r      Min      Max      Q25      Q50      Q75
0ShA+4ShB+s97      400.4135 68.73403 3291 244.1150 665.8430 348.1639 394.4883 445.6004
1ShA+3ShB+s97      418.2675 63.04118 4318 234.1932 717.2235 374.9122 414.7732 459.1624
2ShA+2ShB+s97      428.5679 66.76477 4497 251.5226 694.1119 379.4243 427.1810 476.1074
3ShA+1ShB+s97      448.5453 70.70435 4884 258.2291 811.9168 399.4243 449.2066 496.3832
4ShA+0ShB+s97      446.1919 64.67950 4225 274.7080 770.7598 401.2400 442.2648 487.7584

$comparison
NULL

$groups
              Density.trans groups
3ShA+1ShB+s97      448.5453      a
4ShA+0ShB+s97      446.1919      a
2ShA+2ShB+s97      428.5679      b
1ShA+3ShB+s97      418.2675      c
0ShA+4ShB+s97      400.4135      d

attr(,"class")
[1] "group"
if ("Area" %in% VariableToAnalyze){
Model  = lm(Area.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Area.trans and Treatment respectively)
PWArea <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWArea}
$statistics
   MSerror    Df      Mean        CV
  1.026261 21210 -3.046036 -33.25784

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.857987  0.05

$means
              Area.trans       std    r       Min         Max       Q25       Q50       Q75
0ShA+4ShB+s97  -3.095240 1.0055057 3291 -5.025195 -0.15848089 -3.827277 -3.068278 -2.331314
1ShA+3ShB+s97  -3.083592 1.0310570 4318 -5.025195  0.13049734 -3.864744 -3.090311 -2.331314
2ShA+2ShB+s97  -2.993781 1.0266827 4497 -5.025195 -0.03319156 -3.755123 -2.942751 -2.254696
3ShA+1ShB+s97  -3.039447 0.9862198 4884 -5.025195 -0.28612826 -3.790739 -3.004149 -2.292496
4ShA+0ShB+s97  -3.032560 1.0163072 4225 -5.025195  0.32611506 -3.790739 -3.004149 -2.292496

$comparison
NULL

$groups
              Area.trans groups
2ShA+2ShB+s97  -2.993781      a
4ShA+0ShB+s97  -3.032560     ab
3ShA+1ShB+s97  -3.039447     ab
1ShA+3ShB+s97  -3.083592      b
0ShA+4ShB+s97  -3.095240      b

attr(,"class")
[1] "group"
if ("ClustersNumberPerCell" %in% VariableToAnalyze){
Model  = lm(ClustersNumberPerCell ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-ClustersNumberPerCell and Treatment respectively)
PWClustersNumberPerCell <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWClustersNumberPerCell}
$statistics
      MSerror  Df      Mean       CV
  0.008271292 110 0.2599713 34.98334

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              ClustersNumberPerCell        std  r        Min       Max       Q25       Q50       Q75
0ShA+4ShB+s97             0.2247688 0.05837489 23 0.13479727 0.3526948 0.1844342 0.2214116 0.2495249
1ShA+3ShB+s97             0.2433054 0.08573203 24 0.10937268 0.4316930 0.1851996 0.2285649 0.3103415
2ShA+2ShB+s97             0.2595909 0.08390007 24 0.14180561 0.4120641 0.1776014 0.2629209 0.3253100
3ShA+1ShB+s97             0.2964855 0.11610224 23 0.11165832 0.6063847 0.2008870 0.2992935 0.3734330
4ShA+0ShB+s97             0.2780164 0.10182547 21 0.09983524 0.4733694 0.2005765 0.2889477 0.3405698

$comparison
NULL

$groups
              ClustersNumberPerCell groups
3ShA+1ShB+s97             0.2964855      a
4ShA+0ShB+s97             0.2780164      a
2ShA+2ShB+s97             0.2595909      a
1ShA+3ShB+s97             0.2433054      a
0ShA+4ShB+s97             0.2247688      a

attr(,"class")
[1] "group"
if ("CellSurfaceExpression" %in% VariableToAnalyze){
Model  = lm(CellSurfaceExpression ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-CellSurfaceExpression and Treatment respectively)
PWCellSurfaceExpression <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWCellSurfaceExpression}
$statistics
    MSerror  Df     Mean       CV
  256441691 110 49133.07 32.59271

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              CellSurfaceExpression      std  r      Min       Max      Q25      Q50      Q75
0ShA+4ShB+s97              51590.10 10076.60 23 37444.09  77562.41 46943.87 50556.57 54562.10
1ShA+3ShB+s97              49019.55 11285.33 24 28552.33  81037.94 44728.19 47523.83 52665.58
2ShA+2ShB+s97              44041.60 11102.78 24 20427.47  69952.79 37584.27 42221.34 49288.98
3ShA+1ShB+s97              57416.28 28273.77 23 15125.78 121335.24 38004.18 58794.03 72606.45
4ShA+0ShB+s97              43318.50 11452.80 21 23119.77  64618.78 38922.08 44239.60 50289.20

$comparison
NULL

$groups
              CellSurfaceExpression groups
3ShA+1ShB+s97              57416.28      a
0ShA+4ShB+s97              51590.10     ab
1ShA+3ShB+s97              49019.55     ab
2ShA+2ShB+s97              44041.60      b
4ShA+0ShB+s97              43318.50      b

attr(,"class")
[1] "group"
new_model=lm(FractionofChannelsinClusters ~ Treatment,
              data=CellsDataC)
PWFractionofChannelsinClusters <- HSD.test(new_model, "Treatment", group = T)
PWFractionofChannelsinClusters
$statistics
      MSerror  Df      Mean       CV
  0.002155109 110 0.1723334 26.93799

$parameters
   test    name.t ntr StudentizedRange alpha
  Tukey Treatment   5         3.922386  0.05

$means
              FractionofChannelsinClusters        std  r        Min       Max       Q25       Q50       Q75
0ShA+4ShB+s97                    0.1184674 0.03415173 23 0.06452483 0.1952345 0.1017302 0.1145079 0.1275438
1ShA+3ShB+s97                    0.1441908 0.04804622 24 0.04769675 0.2854157 0.1127500 0.1455464 0.1671480
2ShA+2ShB+s97                    0.1963731 0.04404672 24 0.13103603 0.2817908 0.1611118 0.1946739 0.2279166
3ShA+1ShB+s97                    0.1902971 0.03848746 23 0.09909048 0.2634846 0.1659696 0.1936079 0.2142701
4ShA+0ShB+s97                    0.2163439 0.06367791 21 0.13665866 0.3523840 0.1739099 0.2049573 0.2194671

$comparison
NULL

$groups
              FractionofChannelsinClusters groups
4ShA+0ShB+s97                    0.2163439      a
2ShA+2ShB+s97                    0.1963731      a
3ShA+1ShB+s97                    0.1902971      a
1ShA+3ShB+s97                    0.1441908      b
0ShA+4ShB+s97                    0.1184674      b

attr(,"class")
[1] "group"

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se. the sammerized data will be used to create bar plot with standard error shown in the bars. the summerized data will be saved in a diiferent path as a csv file

if ("CellSurfaceExpression" %in% VariableToAnalyze){ #if the user wants to analyze this variable
Data1 = summarySE(data=CellsDataC,
             "CellSurfaceExpression",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame(PWCellSurfaceExpression$groups) # changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWCellSurfaceExpression$groups) #adding a column for treatment datai, for the next step
Data1 <- merge(Data1,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_1 <- ggplot(Data1,
            aes(x = Treatment, y = CellSurfaceExpression,
                ymax=2.0, ymin=0.0))  +
            geom_bar(stat="identity", fill="dark green",
colour = "dark green", width = 0.7) + geom_errorbar(aes(ymax=CellSurfaceExpression+se, ymin=CellSurfaceExpression-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=0.8, vjust=-4)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean CellSurfaceExpression-Clusters/Area size(um^2)")  +
             ggtitle("Normalized Mean CellSurfaceExpression in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_1+  annotate("text", label = "", y = 1.9,x=5, size = 3)) #setting the plot as needed
write.csv(Data1,"CellSurfaceExpressionSummery.csv", row.names = TRUE)
}

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se. the sammerized data will be used to create bar plot with standard error shown in the bars . the summerized data will be saved in a diiferent path as a csv file

if ("ClustersNumberPerCell" %in% VariableToAnalyze){ #if the user choose to analyze this variable, result shows
Data2 = summarySE(data=CellsDataC,
             "ClustersNumberPerCell",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWClustersNumberPerCell$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWClustersNumberPerCell$groups)#adding a column for treatmet datai, for the next step
Data2 <- merge(Data2,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_2 <- ggplot(Data2,
            aes(x = Treatment, y = ClustersNumberPerCell,
                ymax=1.0, ymin=0.0))  +
            geom_bar(stat="identity", fill="black",
colour = "black", width = 0.7) + geom_errorbar(aes(ymax=ClustersNumberPerCell+se, ymin=ClustersNumberPerCell-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=1.4, vjust=-3)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean ClustersNumberPerCell Per cell")  +
             ggtitle("Normalized Mean ClustersNumberPerCell in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_2+  annotate("text", label = "", y =0.7,x=5, size = 3))
write.csv(Data2,"ClustersNumberPerCellSummery.csv", row.names = TRUE)}

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se. the sammerized data will be used to create bar plot with standard error shown in the bars and also will be used for creating a boxplot for this variable. “Density” The summerized data will be saved in a diiferent path as a csv file.

if ("Density" %in%VariableToAnalyze){ #if the user wants to analyze this variable, the results show
Data3 = summarySE(data=Data2Handle,
             "Density",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data3 <- merge(Data3,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_3 <- ggplot(Data3,
            aes(x = Treatment, y = Density,
                ymax=2.0, ymin=0.0))  +
            geom_bar(stat="identity", fill= "dark blue",
colour = "blue", width = 0.7) + geom_errorbar(aes(ymax=Density+se, ymin=Density-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=1, vjust=-1)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean of Clusters Density (A.U/cluster area size(um^2))")  +
             ggtitle("Mean of Clusters Density in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_3+  annotate("text", label = "", y = 300000,x=5, size = 3))
write.csv(Data3,"DensitySummery.csv", row.names = TRUE)}

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se. the sammerized data will be used to create bar plot with standard error shown in the bars and also will be used for creating a boxplot for this variable. “Area” the summerized data will be saved in a diiferent path as a csv file

if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
Data4 = summarySE(data=Data2Handle,
             "Area",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWArea$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWArea$groups)#adding a column for treatmet datai, for the next step
Data4 <- merge(Data4,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8 # offsets for mean letters
plot_4 <- ggplot(Data4,
            aes(x = Treatment, y = Area,
                ymax=0.11, ymin=0.0))  +
            geom_bar(stat="identity", fill="gray",
colour = "black", width = 0.7) + geom_errorbar(aes(ymax=Area+se, ymin=Area-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=0.5, vjust=-1.5)) + 
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean of Clusters Area size(um^2)")  +
             ggtitle("Mean of Clusters Area Size in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_4+  annotate("text", label = "", y = 0.19,x=5, size = 3))
write.csv(Data4,"AreaSummery.csv", row.names = TRUE)}

Histograms plot for all treatment to see the relative frequency and the distribution of each treatment relativly to others- this is done in the same plot- Both variables are shown in those histograms “Density” and “Area”-VariableToAnalyze

if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
hist_allDensity <- ggplot(Data2Handle, aes(Density/100000, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = Treatment, fill = Treatment))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Density-Integrated Intinsity*10^5 ((A.U)/cluster area size(um^2))",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Density")
hist_allDensity}

if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
hist_allArea <- ggplot(Data2Handle, aes(Area, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = Treatment, fill = Treatment))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Mean of Clusters Area size(um^2)",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Area Size Mean(um^2)") 
hist_allArea}

just in case- another plot to see the variables

last plots are shown, boxplot, while the x-axis this time is shown as a numeric (Ratio of both variants shown as ShB/ShA).That is to see if there is a tendency/correlation between the transfiction ratio and the Density/Area)

if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
df5 <- Data3 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio))))))
# for the summery data of Density: a new column is added, a new presentation of the treatment as a ratio of the both variants
boxDen <- ggplot(df5, aes(x = ratio,fill = ratio)) +
  geom_boxplot(aes(group = ratio,
      lower = Density - se, 
      upper = Density + se, 
      middle = Density, 
      ymin = Density - 3*se, 
      ymax = Density + 3*se),
    stat = "identity") +
            labs(x = "Different ratio of Both Variants ShA and ShB (ShB/ShA)",
                 y = "Mean of Clusters Density (A.U/cluster area size(um^2))")  +
             ggtitle("Mean of Clusters Density as a function of ShB/ShA ratio")+ theme(legend.position="right")
boxDen
}

if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
df6 <- Data4 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio))))))
# for the summery data of Area: a new column is added, a new presentation of the treatment as a ratio of the both variants
#Plotting a boxplot of Area variable whereas the x-axis this time is a numeric variable, with the standard error shown
boxArea <- ggplot(df6, aes(x = ratio,fill = ratio)) +
  geom_boxplot(aes(group = ratio,
      lower = Area - se, 
      upper = Area + se, 
      middle = Area, 
      ymin = Area - 3*se, 
      ymax = Area + 3*se),
    stat = "identity") +
            labs(x = "Different Ratio of Both Variants ShA and ShB (ShB/ShA)",
                 y = "Mean of Cluster Area Size(um^2))")  +
             ggtitle("Mean of Cluster Area Size in Different Ratios of ShB/ShA variants") + theme(legend.position="right")
boxArea }

Density files

write.xlsx(Data2Handle[Data2Handle$Treatment=="4ShA+0ShB+s97",]$Area,"Area_4ShA+0ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="3ShA+1ShB+s97",]$Area,"Area_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="2ShA+2ShB+s97",]$Area,"Area_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="1ShA+3ShB+s97",]$Area,"Area_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="0ShA+4ShB+s97",]$Area,"Area_0ShA+4ShB+s97_sum.xlsx")

write.xlsx(Data2Handle[Data2Handle$Treatment=="4ShA+0ShB+s97",]$Density,"Density_4ShA+0ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="3ShA+1ShB+s97",]$Density,"Density_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="2ShA+2ShB+s97",]$Density,"Density_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="1ShA+3ShB+s97",]$Density,"Density_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="0ShA+4ShB+s97",]$Density,"Density_0ShA+4ShB+s97_sum.xlsx")


write.xlsx(CellsDataC[CellsDataC$Treatment=="0ShA+4ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_0ShA+4ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="1ShA+3ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="2ShA+2ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="3ShA+1ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="4ShA+0ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_4ShA+0ShB+s97_sum.xlsx")




write.xlsx(CellsDataC[CellsDataC$Treatment=="0ShA+4ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_0ShA+4ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="1ShA+3ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="2ShA+2ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="3ShA+1ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="4ShA+0ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_4ShA+0ShB+s97_sum.xlsx")


Data9 = summarySE(data=CellsDataC,
             "FractionofChannelsinClusters",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWFractionofChannelsinClusters$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWFractionofChannelsinClusters$groups)#adding a column for treatmet datai, for the next step
Data9 <- merge(Data9,datai[,-1], by="Treatment")
write.csv(Data9,"FractionofChannelsinClusters.csv", row.names = TRUE)

A simple linear regression will be performed in this chunk, testing for whether the expected values of the treatment predicts successfully the experimental observed means in each treatment. creating new summarized data if the variables Area/Density had been transformed

if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
  if ("Density.trans" %in% colnames(Data2Handle)){ #if the data for variable Density had been transformed, there is a a need to create a new summarized data frame with the transformed values- since linear regression assume normality and homogenty of variances
  Data5 = summarySE(data=Data2Handle,
             "Density.trans",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data5 <- merge(Data5,datai[,-1], by="Treatment")# assigning this into the summery data
names(Data5)[names(Data5) == "Density.trans"] <- "Density" #changing the name of the column, so the rest if thhe code could recognize both if the conditions- transformed/not transformed data
df5 <- Data5 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio)))))) #adding the ratio as number according to the name of the treatment in a new column.
}
equation1expD <- df5$ratio * df5$Density[5] + (1-df5$ratio)* df5$Density[1] #calculating the expected values for each treatment for the Density 
df5$equation1expD <- equation1expD # inserting these expected values to df5
modelD <- lm(Density~equation1expD, data=df5) #performing linear regression to check whether the expected values predict the response-the experimental data.
print(summary(modelD)) #showing the results
print(summ(modelD)) #showing the results in another way
effect_plot(modelD, pred = equation1expD, interval = TRUE, plot.points = TRUE) #presenting the result in a graphical way
}

Call:
lm(formula = Density ~ equation1expD, data = df5)

Residuals:
      1       2       3       4       5 
-3.6168  2.0537  0.1707  7.9646 -6.5722 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)   
(Intercept)   -22.2338    75.2391  -0.296   0.7869   
equation1expD   1.0646     0.1776   5.994   0.0093 **
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 6.428 on 3 degrees of freedom
Multiple R-squared:  0.9229,    Adjusted R-squared:  0.8972 
F-statistic: 35.92 on 1 and 3 DF,  p-value: 0.0093

MODEL INFO:
Observations: 5
Dependent Variable: Density
Type: OLS linear regression 

MODEL FIT:
F(1,3) = 35.92, p = 0.01
R² = 0.92
Adj. R² = 0.90 

Standard errors: OLS
----------------------------------------------------
                        Est.    S.E.   t val.      p
------------------- -------- ------- -------- ------
(Intercept)           -22.23   75.24    -0.30   0.79
equation1expD           1.06    0.18     5.99   0.01
----------------------------------------------------

if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
  if ("Area.trans" %in% colnames(Data2Handle)){ #if the data for variable Area had been transformed, there is a a need to create a new summarized data frame with the transformed values- since linear regression assume normality and homogeneity of variances
  Data6 = summarySE(data=Data2Handle,
             "Area.trans",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups) # changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data6 <- merge(Data6,datai[,-1], by="Treatment")# assigning this into the summery data
names(Data6)[names(Data6) == "Area.trans"] <- "Area" #changing the name of the column, so the rest if thhe code could recognize both if the conditions- transformed/not transformed data
df6 <- Data6 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio)))))) #adding the ratio as number according to the name of the treatment in a new column.
}
  equation1expA <- df6$ratio * df6$Area[5]  + (1-df6$ratio)* df6$Area[1] #calculating the expected values for each treatment for the Area
df6$equation1expA <- equation1expA # inserting these expected values to df6
modelA <- lm(Area~equation1expA, data=df6) #performing linear regression to check whether the expected values predict the response-the experimental data.
print(summary(modelA)) #showing the results
print(summ(modelA))#showing the results in another way
effect_plot(modelA, pred = equation1expA, interval = TRUE, plot.points = TRUE) #presenting the result in a graphical way
}

Call:
lm(formula = Area ~ equation1expA, data = df6)

Residuals:
        1         2         3         4         5 
-0.012415 -0.017718  0.055143 -0.007473 -0.017537 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)
(Intercept)     0.2654     2.2214   0.119    0.912
equation1expA   1.0817     0.7250   1.492    0.233

Residual standard error: 0.03593 on 3 degrees of freedom
Multiple R-squared:  0.4259,    Adjusted R-squared:  0.2346 
F-statistic: 2.226 on 1 and 3 DF,  p-value: 0.2325

MODEL INFO:
Observations: 5
Dependent Variable: Area
Type: OLS linear regression 

MODEL FIT:
F(1,3) = 2.23, p = 0.23
R² = 0.43
Adj. R² = 0.23 

Standard errors: OLS
-------------------------------------------------
                      Est.   S.E.   t val.      p
------------------- ------ ------ -------- ------
(Intercept)           0.27   2.22     0.12   0.91
equation1expA         1.08   0.73     1.49   0.23
-------------------------------------------------

new



hist_allDensity_p <- ggplot(Data2Handle, aes(Density/1000000, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = CellNumber, fill = CellNumber))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Density-Integrated Intinsity*10^6 ((A.U)/cluster area size(um^2))",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Density")
hist_allDensity_p

Here the project ends :)

---
title: "R Notebook"
output: html_notebook
---

Intoduction: Handling the files: reading multiple csv files into the same dataframe-called RawData.
Data will be organized into a specific template, some raws within the RawData will be extracted into a new dataframe - Those will be the data regarding the whole cells.
This small dataframe will be organized, some data will be normalized and some will be added-dataframe is called CellsDataC.
The rest, as well, will be assigned into a new dataframe- called Data2Handle: this data will be cleaned, reorganized and some rows will be eliminated-due to resolution issues.

```{r}
library(ggplot2)
library(dplyr)
library(tidyr)
library(tibble)
library(stringr)
library(reshape2)
library(rlist)
library(readr)
library(purrr)
library(dgof)
library(car)
library(Rmisc)
#library(agricolae)
library(jtools)
library(xlsx)
RawData <- list.files(full.names = TRUE) %>% 
  lapply(read_csv) %>% 
  bind_rows
#Reading multiple csv files into the same dataframe, with only one Head # when the file only contains the data and the R-notebook

Rdata <- RawData %>% drop_na(Label) #dropping NA rows
Rdata <- Rdata[,-c(1)] #deleting extra non-informative columns that appears while reading the csv file to the dataframe
Data2Handle <- Rdata%>%
  separate(Label,c("Treatment","CellNumber"),sep = "_") #%>% #seperating the label column into 2 diffrenet columns labeled as treatment and cell number- these two variables identify diffrernt cells within the same treatment, both are needed in the next steps of analysis

Data2Handle <- select(Data2Handle,-c(MinThr,MaxThr)) #deleting another 2 non-informative columns
Data2Handle[Data2Handle$Treatment=='ShA1/2+ShB1/2+s97',]$Treatment <- c('2ShA+2ShB+s97')
Data2Handle[Data2Handle$Treatment=='1ShA+1ShB+s97',]$Treatment <- c('2ShA+2ShB+s97')
Data2Handle[Data2Handle$Treatment=='sha34+shb14+s97',]$Treatment <- c('3ShA+1ShB+s97')
Data2Handle[Data2Handle$Treatment=='shb3+sha1+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA4+ShB0+s97',]$Treatment <- c('4ShA+0ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA0+ShB4+s97',]$Treatment <- c('0ShA+4ShB+s97')
Data2Handle[Data2Handle$Treatment=='shb34+sha14+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA3+ShB1+s97',]$Treatment <- c('3ShA+1ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA1+ShB3+s97',]$Treatment <- c('1ShA+3ShB+s97')
Data2Handle[Data2Handle$Treatment=='ShA2+ShB2+s97',]$Treatment <- c('2ShA+2ShB+s97')


CellsDataC <- Data2Handle[Data2Handle$Area>100,] # subsetting the data for the whole cell into a new dataframe, in each csv file there is one row for the whole cell- these data contains information about cells 
Data2Handle <- Data2Handle[Data2Handle$Area<100 ,] #eliminating the rows of data regarding the whole cells">300" and also the rows where the clusters area smaller than the resolution of the microscope
CellsData <-  Data2Handle %>%
  group_by(Treatment) %>%
  group_by(CellNumber,.add = TRUE) %>%
  dplyr::summarise(ClusterCount = n(),SumofRawIntDen=sum(RawIntDen),.groups = "keep") #grouping the cells data by treatment and cell number-both variables needed to identify unique cells from each other as written before- and then counting for each cell the cluster number(aka the rows in each file which belong to the specific cell)
CellsDataC <- merge(CellsDataC,CellsData, by=c("Treatment","CellNumber"))
#grouping the cells data by treatment and cell number-both variables needed to identify unique cells from each other as written before- and then counting for each cell the cluster number(aka the rows in each file which belong to the specific cell)
CellsDataC <- CellsDataC %>%
  mutate(ClustersNumberPerCell=ClusterCount/Area,
         non_normlizedClusterNumber=ClusterCount,
         CellSurfaceExpression=RawIntDen/Area,
         FractionofChannelsinClusters=SumofRawIntDen/RawIntDen)
CellsDataC <- CellsDataC[,-9]
#CellsDataC$ClustersNumberPerCell <- CellsData$ClusterCount/CellsDataC$Area
#CellsDataC$non_normlizedClusterNumber<- CellsData$ClusterCount#adding the vairable ClusterCount to the CellsDataC
#CellsDataC$CellSurfaceExpression <- CellsDataC$RawIntDen/CellsDataC$Area #adding the vairable CellSurfaceExpression to the CellsDataC- defined as the clusternumbers per cell normalized to the whole cell area (since a higher area of membrane will contain as well a higher number of clusters)  
#CellsDataC$FractionofChannelsinClusters=CellsData$SumofRawIntDen/CellsDataC$RawIntDen
#CellsDataC$SumofRawIntDen <- CellsData$SumofRawIntDen
#CellsDataC #showing the data to have a general look


Data2Handle <-Data2Handle %>%
  mutate(Density= RawIntDen/Area) # adding a new variable calculated by the area and RawIntDen variables in the data for each cluster in the cells, defined as Density
Data2Handle<- Data2Handle[,-9]#showing the data to have a general look
Data2Handle
CellsDataC

CellsData_sum <-  Data2Handle %>%
  group_by(Treatment) %>%
  group_by(CellNumber,.add = TRUE) %>%
  dplyr::summarise(ClusterCount = n(),MeanDen = mean(Density),MeanArea = mean(Area),Standardddensity= sd(Density),StandarddArea= sd(Area),.groups = "keep")
write.csv(CellsData_sum,"CellsData_sum.csv", row.names = TRUE)

write.xlsx(CellsDataC,"CellsDataC_sum.xlsx", row.names = TRUE)
```

User must give as an input the number of variables he wants to be analyzed and their exact names as they show up in the Data: Data2Handle or CellsDataC.
the input in the variable VariableToAnalyze will be used in next chunks to decide whether to run a specific commands or not-that is to run only the parts for the variables shows in VariableToAnalyze as the user decide However, rigth now this chunk is deactivated- and the value of VariableToAnalyze has been assigned as all the variables to be analyzed .

```{r}
numberofvar <- as.integer(readline(prompt=paste("How many you want to analyze:"))) #asking the user to decide how many variables he want to analyze 
numvec <- c(1:numberofvar) # creating vector of number according to the varaible number input in the previous command
inputvec <- function(num=0)#function to take multiple variable names
  #argument: the number of the variable to take as an input, to show to it to the user
  #value: the exact name of the variable to analyse -as it showes in the dataframe
{
  vec <- readline(prompt=paste("what is your", num, "variable exact name "))
  return(vec)
}
VariableToAnalyze <- sapply(numvec, inputvec)# vectorized command to take as an input the exact names of the variables to be analyzed
```

The next chuck will process the Data for the specific variables given in VariableToAnalyze: the function will perform Kolmogorov-Smirnov Normality test and Levene's test for homoscedasticity.
The result for the variables will be assigned in a new dataframe with the proper column names.
Note: True in the new dataframe-indicate4transformationAll stands for p-value<0.05 (aka the assumption is violated)

```{r}
DeterminePropTest <- function(VarName,Data)
  #description of the function: the function accept as an argumnt the variable name as it displayed in the data and check homogeneity of variance across groups and normality of residuals
  #argument-the name of variable to check and the data
  #value- vector of logical values (true stands for assumption violation) the 1st-homogeneity-levene's test, the 2nd- Kolmogorov-Smirnov Tests
{
names(Data)[names(Data) == VarName] <- "VarName" #in order to perform the test on a specific variable, changing the name of this variable to VarName- this make it easier to generalize the code, thus it could handle all the variable names as will be seen in next commands
  Model  = lm(VarName ~ Treatment,
              data=Data) #creating the model for Anova- defining the dependent and independent variables 
Anova(Model)   
summary(Model)
#all these parameters sets the model for further analysis
vec <- as.integer(readline(prompt=paste("Do you want to see the plots (Residuals Histogram and Residuals vs. predicted values) of the variable",VarName,"Yes-1, No-0 "))) #according to the user choice-shows or not-the plots to assess the data by himself in addition to the test carried out in the function
if (vec==1) 
  {hist(residuals(Model),col="darkgray")
#plot(fitted(Model),residuals(Model)) #plotting the fitted data to see The distribution of these residuals as it should be approximately normal.
qqnorm(Model$residuals)  # Another method of looking at the residuals normality
qqline(Model$residuals)# setting the normal line, in order to assist the residuals normality
pvaluelevene <- leveneTest(VarName ~ Treatment,data=Data,center=mean)[1,3] #assigning the p-value of levens test into a new variable
pvalueks <- ks.test(Model$residuals,pnorm,mean(Model$residuals),sd(Model$residuals))[[2]] #assigning the p-value of ks into a new variable
return (c(pvaluelevene<0.01,pvalueks<0.01)) #returning both values
}else
{pvaluelevene <- leveneTest(VarName ~ Treatment,data=Data,center=mean)[1,3]#assigning the p-value of levens test into a new variable
pvalueks <- ks.test(Model$residuals,pnorm,mean(Model$residuals),sd(Model$residuals))[[2]]#assigning the p-value of ks into a new variable
return (c(pvaluelevene<0.01,pvalueks<0.01))} #returning both values
}
#result4Transofrmation <- DeterminePropTest("ClustersNumberPerCell", CellsDataC)
#VariableToAnalyze <- c('ClustersNumberPerCell','Density','Area','CellSurfaceExpression')
#x <- VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell']
indicate4transformationW <- t(sapply(VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell'|VariableToAnalyze=='CellSurfaceExpression'|VariableToAnalyze=='FractionofChannelsinClusters'], DeterminePropTest, CellsDataC)) #subsetting the variables name belongging to the dataframe CellsDataC, on these variables, the function is called
indicate4transformationC <- t(sapply(VariableToAnalyze[VariableToAnalyze=='Area'|VariableToAnalyze=='Density'], DeterminePropTest, Data2Handle)) #subsetting the variables name belongging to the dataframe Data2Handle, on these variables, the function is called
# a result of True for both columns stands for a need of transformation in the variable- shows in name of the row 
if (length(indicate4transformationC)!=0){ # this condition is to handle a situation where is no variables given by the user in the second chunk
  colnames(indicate4transformationC) <- c("Levene's test", "Kolmogorov-Smirnov Tests") #setting the column names
  indicate4transformationC #showingg the results
  indicate4transformationC <- as.data.frame(indicate4transformationC) %>%
  add_column(VarN=VariableToAnalyze[VariableToAnalyze=='Area'|VariableToAnalyze=='Density'],.before=1) #adding a new column with the variable names
row.names(indicate4transformationC) <- NULL #resetting the the rownames
indicate4transformationC
}
 if (length(indicate4transformationW)!=0){ # this condition is to handle a sitution where is no variables given by the user in the second chunk 
  colnames(indicate4transformationW) <- c("Levene's test", "Kolmogorov-Smirnov Tests")#setting the column names
  indicate4transformationW #showingg the results
indicate4transformationW <- as.data.frame(indicate4transformationW) %>%
  add_column(VarN=VariableToAnalyze[VariableToAnalyze=='ClustersNumberPerCell'|VariableToAnalyze=='CellSurfaceExpression'|VariableToAnalyze=='FractionofChannelsinClusters'],.before=1) #adding a new column with the variable names
row.names(indicate4transformationW) <- NULL #resetting the the rownames
indicate4transformationW
}
indicate4transformationAll <- rbind.data.frame(indicate4transformationC,indicate4transformationW) #bindingg the dataframes togather
indicate4transformationAll
```

This chunk is responsible for calculating the transformation when ever needed.
the transformation type is determined by an Manual calibration of my own data behavior.
also here the transformation is carried out only when the data met the conditions to perform one (when both assumptions: normality and homoscedasticity, are violated) Note: I used the "-\>\>" whitch makes the data global, This allowed me to change and update directly both dataframes whenever i needed(CellsDataC/Data2Handle)

```{r}
trans.log <- function(x){return(log2(x))} #mathmatical function for transformation calculation
trans.sqrt <- function(x){return(sqrt(x+0.5))} #mathmatical function for transformation calculation
trans.receip <- function(x){return(1/(x))} #mathmatical function for transformation calculation
#old <- Data2Handle
#Data2Handle <- backup
TransUpdate <- function(VarRow)
  #function description: the function accept as arggument/input a row of dataframe, containeing 3 columns. the first row is the variable needed to tranform the second and the third are the result of levenes and ks test - True stands for a p-value lower than 0.05 (both true stands for a need of transformation of the data for this specific variable)
  #outpot is a dataframe updated with a column of transformated data of variable
{
  #VarRow <- indicate4transformation[2,]
  Data2Handle<<-Data2Handle # changing the dataframe of each cluster to be global/ also shows in the function itself
  CellsDataC<<-CellsDataC# changing the dataframe of the whole cells to be global/ also shows in the function itself
  if (VarRow[1] %in% c("Density","Area")) #checking if the variable belong to the data of clusters
  {
    if (VarRow[1]=="Density"){ #if the variable in the argument is density, then calculating the appropriate type of transformation for the density
      Data2Handle$Density.trans<<- sapply(Data2Handle$Density,trans.sqrt)
    } else{ #if the variable in the argument is Area, then calculating the appropriate type of transformation for the area
      Data2Handle$Area.trans<<- sapply(Data2Handle$Area,trans.log)}
  }else #otherwise, the variable belong to the whole cell data
  {
    if (VarRow[1]=='CellSurfaceExpression'){CellsDataC$CellSurfaceExpression.trans <<- sapply(CellsDataC$CellSurfaceExpression,trans.sqrt)} #if the variable in the argument is CellSurfaceExpression, then calculating the appropriate type of transformation for the CellSurfaceExpression
      else{CellsDataC$ClustersNumberPerCell.trans <<- sapply(CellsDataC$ClustersNumberPerCell,trans.receip)} #if the variable in the argument is ClustersNumberPerCell, then calculating the appropriate type of transformation for the ClustersNumberPerCell
  }
  
  CellsDataC<<-CellsDataC
  Data2Handle<<-Data2Handle 
}
if(!is.null(indicate4transformationAll)){
apply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T,],MARGIN=1,TransUpdate)} #applying the function on the data frame
#please pay attention, the apply is suitable for updating global variables
Data2Handle
CellsDataC

```

This chunk contain all the possible statistics test suitable for a numeric dependent variable and a independent variable with more then two levels.
This is needed since i have partial data for now, i don't know whether my data will violate both assumption in the future or not, so just in case.
The tests will be performed according to indicate4transformationAll results.
When the conditions aren't met, A Null dataframe will be sent to the test so no result will be shown.
As it could be seen my partial data will need ANOVA (after transformation/ with no transformation)

```{r}

welchtest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of welch test of the variable
{
  names(Data)[names(Data) == varname] <- "varname" #changing the column name-in the argumnets- to varname, this helps generlaizing the code, so it would run in each variable name
  oneway.test(varname ~ Treatment,
            data=Data,
            var.equal=FALSE) #performing the test
}
print("Welch test result if the conditions are met is below")
CWelch <- sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F & (indicate4transformationAll$VarN=='Area'|indicate4transformationAll$VarN=='Density'),1], welchtest, Data2Handle)
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F & (indicate4transformationAll$VarN=='CellSurfaceExpression'|indicate4transformationAll$VarN=='ClustersNumberPerCell'),1], welchtest, CellsDataC)
#both commands send the the two diffrenet Dataframe i have to the function for welch-test with the relavant variable name, only when the result of levens and Kolmogorov-Smirno is T,F respectivly - that is heteroscedasticity is found in the diffrent groups. this is done by sending the appropriate dataframe with the variable to test.
####################

KWtest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of Kruskal-walis test of the variable
{
  names(Data)[names(Data) == varname] <- "varname" #changing the column name-in the argumnets- to varname, this helps generlaizing the code, so it would run in each variable name
  kruskal.test(varname ~ Treatment,
                data = Data) #performing the test
}
print("Kruskal-Wallis test result if the conditions are met is below")
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T & (indicate4transformationAll$VarN=='Area'|indicate4transformationAll$VarN=='Density'),1], KWtest, Data2Handle)
sapply(indicate4transformationAll[indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T & (indicate4transformationAll$VarN=='CellSurfaceExpression'|indicate4transformationAll$VarN=='ClustersNumberPerCell'),1], KWtest, CellsDataC)
#both commands send the the two diffrenet Dataframe i have to the function for kruskal-wallis, only when the result of levens and Kolmogorov-Smirno is F,T respectivly - that is Normality assumption is violated. this is done each time by sending the appropriate dataframe with the variable to test.
#############


Anovatest <- function(varname, Data)
  # description: the function takes as an argument a variable that is found in the data, along with variable needed to be tested the function takes also the data itself.
  #output: Null, the function shows the result of Anova test of the variable
{
  names(Data)[names(Data) == varname] <- "varname"
  Model  = lm(varname ~ Treatment,
              data=Data) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis and Treatment respectively)
c <- c(unlist(anova(Model)[1,4:5]) )#
return(c)
}
#############

print("ANOVA test result if the conditions are met is below")
new4anova <- indicate4transformationAll[(indicate4transformationAll$`Levene's test`==F & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==F) |(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),] #subsetiing the variables suitable for ANOVA test (transformed data and data which is Normal and has homogeneity of variances), from the result dataframe of levens and Kolmogorov-Smirnov.

new4anova[(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),]$VarN <- sapply(new4anova[(indicate4transformationAll$`Levene's test`==T & indicate4transformationAll$`Kolmogorov-Smirnov Tests`==T),]$VarN,paste,sep=".","trans")
#updating the results dataframe first column when the variable in the row is transformated.

AnovaresC <- sapply(new4anova[(new4anova$VarN=="Density.trans" | new4anova$VarN=="Area.trans" |new4anova$VarN=="Density" | new4anova$VarN=="Area"),1],Anovatest, Data2Handle)
#appling ANOVA on the variables suited with the test in the Dataframe of the clusters
AnovaresW <- sapply(new4anova[(new4anova$VarN=="CellSurfaceExpression.trans" | new4anova$VarN=="ClustersNumberPerCell.trans"|new4anova$VarN=="CellSurfaceExpression" | new4anova$VarN=="ClustersNumberPerCell"),1],Anovatest, CellsDataC)
#appling ANOVA on the variables suited with the test in the Dataframe of whole cells

AnovaresC
AnovaresW




```

this

```{r}
#check differences in FractionofChannelsinClusters
AnovaresW_3 <- sapply("FractionofChannelsinClusters",Anovatest, CellsDataC)
AnovaresW_3

#check 
#ModelFraction  = lm(FractionofChannelsinClusters ~ Treatment,
#              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis and Treatment respectively)
#summary(ModelFraction)
```

In this chuck a Tuckey pairwise multiple comparison (post-hoc) is done for all the variables.

```{r}
library(agricolae)
#in all those commands a tukey multiple comparsion test is carried out
if ("Density" %in% VariableToAnalyze){
model  = lm(Density.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Density.trans and Treatment respectively)
PWDensity <- (HSD.test(model, "Treatment", group = F)) #tukey test
PWDensity}
if ("Area" %in% VariableToAnalyze){
Model  = lm(Area.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Area.trans and Treatment respectively)
PWArea <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWArea}
if ("ClustersNumberPerCell" %in% VariableToAnalyze){
Model  = lm(ClustersNumberPerCell ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-ClustersNumberPerCell and Treatment respectively)
PWClustersNumberPerCell <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWClustersNumberPerCell}
if ("CellSurfaceExpression" %in% VariableToAnalyze){
Model  = lm(CellSurfaceExpression ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-CellSurfaceExpression and Treatment respectively)
PWCellSurfaceExpression <- (HSD.test(Model, "Treatment", group = F)) #tukey test
PWCellSurfaceExpression}

new_model=lm(FractionofChannelsinClusters ~ Treatment,
              data=CellsDataC)
PWFractionofChannelsinClusters <- HSD.test(new_model, "Treatment", group = F)
PWFractionofChannelsinClusters
```

new try

```{r}
library(agricolae)
#in all those commands a tukey multiple comparsion test is carried out
if ("Density" %in% VariableToAnalyze){
model  = lm(Density.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Density.trans and Treatment respectively)
PWDensity <- (HSD.test(model, "Treatment", group = T)) #tukey test
PWDensity}
if ("Area" %in% VariableToAnalyze){
Model  = lm(Area.trans ~ Treatment,
              data=Data2Handle) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-Area.trans and Treatment respectively)
PWArea <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWArea}
if ("ClustersNumberPerCell" %in% VariableToAnalyze){
Model  = lm(ClustersNumberPerCell ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-ClustersNumberPerCell and Treatment respectively)
PWClustersNumberPerCell <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWClustersNumberPerCell}
if ("CellSurfaceExpression" %in% VariableToAnalyze){
Model  = lm(CellSurfaceExpression ~ Treatment,
              data=CellsDataC) #creating the model for Anova- defining the dependent and independent variables (Variable to analysis-CellSurfaceExpression and Treatment respectively)
PWCellSurfaceExpression <- (HSD.test(Model, "Treatment", group = T)) #tukey test
PWCellSurfaceExpression}

new_model=lm(FractionofChannelsinClusters ~ Treatment,
              data=CellsDataC)
PWFractionofChannelsinClusters <- HSD.test(new_model, "Treatment", group = T)
PWFractionofChannelsinClusters
```

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se.
the sammerized data will be used to create bar plot with standard error shown in the bars.
the summerized data will be saved in a diiferent path as a csv file

```{r}
if ("CellSurfaceExpression" %in% VariableToAnalyze){ #if the user wants to analyze this variable
Data1 = summarySE(data=CellsDataC,
             "CellSurfaceExpression",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame(PWCellSurfaceExpression$groups) # changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWCellSurfaceExpression$groups) #adding a column for treatment datai, for the next step
Data1 <- merge(Data1,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_1 <- ggplot(Data1,
            aes(x = Treatment, y = CellSurfaceExpression,
                ymax=2.0, ymin=0.0))  +
            geom_bar(stat="identity", fill="dark green",
colour = "dark green", width = 0.7) + geom_errorbar(aes(ymax=CellSurfaceExpression+se, ymin=CellSurfaceExpression-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=0.8, vjust=-4)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean CellSurfaceExpression-Clusters/Area size(um^2)")  +
             ggtitle("Normalized Mean CellSurfaceExpression in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_1+  annotate("text", label = "", y = 1.9,x=5, size = 3)) #setting the plot as needed
write.csv(Data1,"CellSurfaceExpressionSummery.csv", row.names = TRUE)
}
```

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se.
the sammerized data will be used to create bar plot with standard error shown in the bars .
the summerized data will be saved in a diiferent path as a csv file

```{r}
if ("ClustersNumberPerCell" %in% VariableToAnalyze){ #if the user choose to analyze this variable, result shows
Data2 = summarySE(data=CellsDataC,
             "ClustersNumberPerCell",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWClustersNumberPerCell$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWClustersNumberPerCell$groups)#adding a column for treatmet datai, for the next step
Data2 <- merge(Data2,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_2 <- ggplot(Data2,
            aes(x = Treatment, y = ClustersNumberPerCell,
                ymax=1.0, ymin=0.0))  +
            geom_bar(stat="identity", fill="black",
colour = "black", width = 0.7) + geom_errorbar(aes(ymax=ClustersNumberPerCell+se, ymin=ClustersNumberPerCell-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=1.4, vjust=-3)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean ClustersNumberPerCell Per cell")  +
             ggtitle("Normalized Mean ClustersNumberPerCell in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_2+  annotate("text", label = "", y =0.7,x=5, size = 3))
write.csv(Data2,"ClustersNumberPerCellSummery.csv", row.names = TRUE)}
```

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se.
the sammerized data will be used to create bar plot with standard error shown in the bars and also will be used for creating a boxplot for this variable.
"Density" The summerized data will be saved in a diiferent path as a csv file.

```{r}
if ("Density" %in%VariableToAnalyze){ #if the user wants to analyze this variable, the results show
Data3 = summarySE(data=Data2Handle,
             "Density",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data3 <- merge(Data3,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8
plot_3 <- ggplot(Data3,
            aes(x = Treatment, y = Density,
                ymax=2.0, ymin=0.0))  +
            geom_bar(stat="identity", fill= "dark blue",
colour = "blue", width = 0.7) + geom_errorbar(aes(ymax=Density+se, ymin=Density-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=1, vjust=-1)) +
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean of Clusters Density (A.U/cluster area size(um^2))")  +
             ggtitle("Mean of Clusters Density in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_3+  annotate("text", label = "", y = 300000,x=5, size = 3))
write.csv(Data3,"DensitySummery.csv", row.names = TRUE)}
```

Summerizing the data for the variable CellSurfaceExpression, calculation the standard error for a bar plot with se.
the sammerized data will be used to create bar plot with standard error shown in the bars and also will be used for creating a boxplot for this variable.
"Area" the summerized data will be saved in a diiferent path as a csv file

```{r}
if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
Data4 = summarySE(data=Data2Handle,
             "Area",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWArea$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWArea$groups)#adding a column for treatmet datai, for the next step
Data4 <- merge(Data4,datai[,-1], by="Treatment")# assigning this into the summery data
offset.v = -3 # offsets for mean letters 
offset.h = 0.8 # offsets for mean letters
plot_4 <- ggplot(Data4,
            aes(x = Treatment, y = Area,
                ymax=0.11, ymin=0.0))  +
            geom_bar(stat="identity", fill="gray",
colour = "black", width = 0.7) + geom_errorbar(aes(ymax=Area+se, ymin=Area-se),
width=0.2, size=0.5, color="black") + geom_text(aes(label=groups,
                      hjust=0.5, vjust=-1.5)) + 
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean of Clusters Area size(um^2)")  +
             ggtitle("Mean of Clusters Area Size in different ShB/ShA ratios") +
            theme_bw()  +
            theme(plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.border = element_rect(colour="black"),
                  panel.grid.major = element_blank(), 
                  panel.grid.minor = element_blank(),
                  panel.background = element_blank(), 
                  axis.line = element_line(colour = "black"))
print(plot_4+  annotate("text", label = "", y = 0.19,x=5, size = 3))
write.csv(Data4,"AreaSummery.csv", row.names = TRUE)}
```

Histograms plot for all treatment to see the relative frequency and the distribution of each treatment relativly to others- this is done in the same plot- Both variables are shown in those histograms "Density" and "Area"-VariableToAnalyze

```{r}
if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
hist_allDensity <- ggplot(Data2Handle, aes(Density/100000, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = Treatment, fill = Treatment))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Density-Integrated Intinsity*10^5 ((A.U)/cluster area size(um^2))",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Density")
hist_allDensity}
if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
hist_allArea <- ggplot(Data2Handle, aes(Area, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = Treatment, fill = Treatment))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Mean of Clusters Area size(um^2)",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Area Size Mean(um^2)") 
hist_allArea}
```

just in case- another plot to see the variables

```{r eval=FALSE, include=FALSE}

df4 <- Data2Handle %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio))))))
BoxbDensity <- ggplot(df4, aes(x=ratio, y=Density/1000000, fill = Treatment)) + 
  geom_boxplot() + theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Different ratio of Both Variants ShA and ShB (ShB/ShA)",
                 y = "Density-Integrated Intinsity*10^6 ((A.U)/cluster area size(um^2))")  +
             ggtitle("Clusters Density in Different Ratios of ShB/ShA variants")
BoxbDensity

BoxbArea <- ggplot(df4, aes(x=ratio, y=Area, fill = Treatment)) + 
  geom_boxplot() + theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Different ratio of Both Variants ShA and ShB",
                 y = "Mean of Cluster Area Size(um^2))")  +
             ggtitle("Mean of Cluster Area Size in Different Ratios of ShB/ShA variants")
BoxbArea

```

last plots are shown, boxplot, while the x-axis this time is shown as a numeric (Ratio of both variants shown as ShB/ShA).That is to see if there is a tendency/correlation between the transfiction ratio and the Density/Area)

```{r}
if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
df5 <- Data3 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio))))))
# for the summery data of Density: a new column is added, a new presentation of the treatment as a ratio of the both variants
boxDen <- ggplot(df5, aes(x = ratio,fill = ratio)) +
  geom_boxplot(aes(group = ratio,
      lower = Density - se, 
      upper = Density + se, 
      middle = Density, 
      ymin = Density - 3*se, 
      ymax = Density + 3*se),
    stat = "identity") +
            labs(x = "Different ratio of Both Variants ShA and ShB (ShB/ShA)",
                 y = "Mean of Clusters Density (A.U/cluster area size(um^2))")  +
             ggtitle("Mean of Clusters Density as a function of ShB/ShA ratio")+ theme(legend.position="right")
boxDen
}
if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
df6 <- Data4 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio))))))
# for the summery data of Area: a new column is added, a new presentation of the treatment as a ratio of the both variants
#Plotting a boxplot of Area variable whereas the x-axis this time is a numeric variable, with the standard error shown
boxArea <- ggplot(df6, aes(x = ratio,fill = ratio)) +
  geom_boxplot(aes(group = ratio,
      lower = Area - se, 
      upper = Area + se, 
      middle = Area, 
      ymin = Area - 3*se, 
      ymax = Area + 3*se),
    stat = "identity") +
            labs(x = "Different Ratio of Both Variants ShA and ShB (ShB/ShA)",
                 y = "Mean of Cluster Area Size(um^2))")  +
             ggtitle("Mean of Cluster Area Size in Different Ratios of ShB/ShA variants") + theme(legend.position="right")
boxArea }

```

Density files

```{r}
write.xlsx(Data2Handle[Data2Handle$Treatment=="4ShA+0ShB+s97",]$Area,"Area_4ShA+0ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="3ShA+1ShB+s97",]$Area,"Area_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="2ShA+2ShB+s97",]$Area,"Area_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="1ShA+3ShB+s97",]$Area,"Area_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="0ShA+4ShB+s97",]$Area,"Area_0ShA+4ShB+s97_sum.xlsx")

write.xlsx(Data2Handle[Data2Handle$Treatment=="4ShA+0ShB+s97",]$Density,"Density_4ShA+0ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="3ShA+1ShB+s97",]$Density,"Density_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="2ShA+2ShB+s97",]$Density,"Density_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="1ShA+3ShB+s97",]$Density,"Density_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(Data2Handle[Data2Handle$Treatment=="0ShA+4ShB+s97",]$Density,"Density_0ShA+4ShB+s97_sum.xlsx")


write.xlsx(CellsDataC[CellsDataC$Treatment=="0ShA+4ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_0ShA+4ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="1ShA+3ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="2ShA+2ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="3ShA+1ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="4ShA+0ShB+s97",]$FractionofChannelsinClusters,"FractionofChannelsinClusters_4ShA+0ShB+s97_sum.xlsx")




write.xlsx(CellsDataC[CellsDataC$Treatment=="0ShA+4ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_0ShA+4ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="1ShA+3ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_1ShA+3ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="2ShA+2ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_2ShA+2ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="3ShA+1ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_3ShA+1ShB+s97_sum.xlsx")
write.xlsx(CellsDataC[CellsDataC$Treatment=="4ShA+0ShB+s97",]$CellSurfaceExpression,"CellSurfaceExpression_4ShA+0ShB+s97_sum.xlsx")


Data9 = summarySE(data=CellsDataC,
             "FractionofChannelsinClusters",
             groupvars="Treatment",
             conf.interval = 0.95)#creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWFractionofChannelsinClusters$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWFractionofChannelsinClusters$groups)#adding a column for treatmet datai, for the next step
Data9 <- merge(Data9,datai[,-1], by="Treatment")
write.csv(Data9,"FractionofChannelsinClusters.csv", row.names = TRUE)
```

A simple linear regression will be performed in this chunk, testing for whether the expected values of the treatment predicts successfully the experimental observed means in each treatment.
creating new summarized data if the variables Area/Density had been transformed

```{r}
if ("Density" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
  if ("Density.trans" %in% colnames(Data2Handle)){ #if the data for variable Density had been transformed, there is a a need to create a new summarized data frame with the transformed values- since linear regression assume normality and homogenty of variances
  Data5 = summarySE(data=Data2Handle,
             "Density.trans",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups)# changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data5 <- merge(Data5,datai[,-1], by="Treatment")# assigning this into the summery data
names(Data5)[names(Data5) == "Density.trans"] <- "Density" #changing the name of the column, so the rest if thhe code could recognize both if the conditions- transformed/not transformed data
df5 <- Data5 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio)))))) #adding the ratio as number according to the name of the treatment in a new column.
}
equation1expD <- df5$ratio * df5$Density[5]	+ (1-df5$ratio)* df5$Density[1] #calculating the expected values for each treatment for the Density 
df5$equation1expD <- equation1expD # inserting these expected values to df5
modelD <- lm(Density~equation1expD, data=df5) #performing linear regression to check whether the expected values predict the response-the experimental data.
print(summary(modelD)) #showing the results
print(summ(modelD)) #showing the results in another way
effect_plot(modelD, pred = equation1expD, interval = TRUE, plot.points = TRUE) #presenting the result in a graphical way
}
if ("Area" %in% VariableToAnalyze){ #if the user wants to analyze this variable, the results show
  if ("Area.trans" %in% colnames(Data2Handle)){ #if the data for variable Area had been transformed, there is a a need to create a new summarized data frame with the transformed values- since linear regression assume normality and homogeneity of variances
  Data6 = summarySE(data=Data2Handle,
             "Area.trans",
             groupvars="Treatment",
             conf.interval = 0.95) #creatingg a new summery dataframe gor plotting purposes
datai <- as.data.frame.list(PWDensity$groups) # changing the tukey result for this variable into a dataframe- in order to extract the group from it, whitch shows wether two levels within the tested variable differ significatly
datai$Treatment <- row.names(PWDensity$groups)#adding a column for treatmet datai, for the next step
Data6 <- merge(Data6,datai[,-1], by="Treatment")# assigning this into the summery data
names(Data6)[names(Data6) == "Area.trans"] <- "Area" #changing the name of the column, so the rest if thhe code could recognize both if the conditions- transformed/not transformed data
df6 <- Data6 %>%
  add_column(.after=7,ratio = NA)%>%
  mutate(ratio= ifelse( Treatment =="4ShA+0ShB+s97",1,
                            ifelse(Treatment=="0ShA+4ShB+s97", 0,
                                   ifelse(Treatment=="2ShA+2ShB+s97",1/2,
                                          ifelse(Treatment=="1ShA+3ShB+s97",1/4,
                                                 ifelse(Treatment=='3ShA+1ShB+s97',3/4,ratio)))))) #adding the ratio as number according to the name of the treatment in a new column.
}
  equation1expA <- df6$ratio * df6$Area[5]	+ (1-df6$ratio)* df6$Area[1] #calculating the expected values for each treatment for the Area
df6$equation1expA <- equation1expA # inserting these expected values to df6
modelA <- lm(Area~equation1expA, data=df6) #performing linear regression to check whether the expected values predict the response-the experimental data.
print(summary(modelA)) #showing the results
print(summ(modelA))#showing the results in another way
effect_plot(modelA, pred = equation1expA, interval = TRUE, plot.points = TRUE) #presenting the result in a graphical way
}
```

new

```{r}


hist_allDensity_p <- ggplot(Data2Handle, aes(Density/1000000, ymax=1.0, ymin=0.0)) +
  facet_wrap(~Treatment, scales = 'free_x') +
  geom_histogram(aes(y = stat(count) / sum(count),color = CellNumber, fill = CellNumber))+  
            theme(legend.position="none", plot.title = element_text(size = rel(1.2),
                  face = "bold", vjust = 1.5, hjust = 0.5),
                  axis.title = element_text(face = "bold"),
                  axis.title.y = element_text(vjust= 1.8),
                  axis.title.x = element_text(vjust= -0.5),
                  axis.text.x=element_text( size=7),
                  panel.grid.major = element_blank(), 
                  panel.background = element_blank(),
                  plot.caption = element_text(hjust = 0.5),
                  axis.line = element_line(colour = "black"))+
            labs(x = "Density-Integrated Intinsity*10^6 ((A.U)/cluster area size(um^2))",
                 y = "Relative frequency")  +
             ggtitle("Relative frequency of Clusters Density")
hist_allDensity_p
```

Here the project ends :) 
